Learning Disjunction of Conjunctions

نویسنده

  • Leslie G. Valiant
چکیده

The quest ion of whether concepts expressible as d is junc t ions of conjunct ions can be learned from examples in polynomial t ime is i nves t i ga ted . Pos i t i ve resu l t s are shown fo r s i g n i f i c a n t sub­ classes tha t a l low not only p ropos i t i ona l p re ­ d icates but also some r e l a t i o n s . The algori thms are extended so as to be provably t o l e ran t to a ce r ta i n q u a n t i f i a b l e e r ro r ra te in the examples data. I t i s f u r t he r shown tha t under ce r ta in r e ­ s t r i c t i o n s on these subclasses the learn ing a lgo­ r i thms are w e l l su i ted to implementation on neural networks of threshold elements. The possib le im­ portance of d is junc t ions of conjunct ions as a know­ ledge representat ion stems from the observations tha t on the one hand humans appear to l i k e using i t , and, on the o ther , tha t there is c i rcumstan t ia l evidence tha t s i g n i f i c a n t l y la rger classes may not be learnable in polynomial t ime. An NP-completeness r e s u l t corroborat ing the l a t t e r is a lso p re ­ sented. 1 . In t roduc t ion The a b i l i t y of humans to learn new concepts is remarkable and myster ious. L i t t l e progress has been made in the problem of s imula t ing t h i s process by computer. Worse s t i l l there is l i t t l e agreement on what r e a l i s t i c spec i f i ca t ions of the desired behavior of such a s imula t ion might look l i k e . The approach taken to t h i s problem in a previous paper [13] was to appeal to computational complexity theory to provide some guidel ines on t h i s l a s t quest ion . In p a r t i c u l a r , var ious classes of prop­ o s i t i o n a l concepts were explored tha t could be learned from examples, or sometimes o rac les , in a polynomial number of computational s teps. Pre­ sumably b i o l o g i c a l systems cannot learn classes tha t are computat ional ly i n t r a c t a b l e . The o v e r a l l model was h i e r a r c h i c a l . A concept to be learned was expressed as a p ropos i t i ona l expression in terms of concepts already known. I f t h i s expression was too complex it was computation­ a l l y i n f eas i b l e to learn i t . The maximal complex­ i t y of such expressions tha t could be learned feas ib l y was s tud ied . That i nves t i ga t i on pointed to one p a r t i c u l a r concept class tha t espec ia l l y warranted fu r t he r i nves t iga t ion those tha t could be expressed as a ♦Supported in pa r t by Nat ional Science Foundation grant MCS-83-02385. d i s j unc t i on o f conjunct ions, ca l l ed d i s j unc t i ve normal form (DNF). The a t t r a c t i o n of t h i s class is tha t humans appear to l i k e i t f o r represent ing knowledge as is evidenced, f o r example, by the success of the product ion system paradigm [6] and of Horn clause log ics [ 8 ] . Our inves t iga t ions sug­ gest tha t s i g n i f i c a n t l y larger classes may not be learnable in polynomial t ime. Hence t h i s class may tu rn out to have a cen t ra l r o l e . This paper is organized as f o l l ows . In Sec­ t i o n 2 the formal framework w i l l be described f i r s t . I t w i l l then be shown tha t s i g n i f i c a n t sub­ classes of p ropos i t i ona l DNF expressions can be learned in polynomial time j u s t from examples. Whether the whole class can be so learned remains an open problem. The sect ion fo l low ing tha t w i l l discuss how the above resu l t s can be extended to al low v a r i ­ ables and re l a t i ons in the manner of predicate ca lcu lus . The extension is to a very r e s t r i c t e d par t of the predicate calculus where there is only e x i s t e n t i a l q u a n t i f i c a t i o n and a l i m i t e d number of va r i ab les . In Section 4 the question as to whether the above a lgor i thm can be made robust to er rors in the data is discussed. I t is shown tha t the learn ing algori thms can indeed be made robust to a small ra te of even mal ic ious ly constructed e r ro r s . The s u i t a b i l i t y of our learn ing algori thms to implementations on d i s t r i b u t e d " b r a i n l i k e " models of computation is the subject of Section 5. In the p ropos i t i ona l case, under ce r ta in na tu ra l r e s t r i c ­ t i o n s , such implementations are immediate, but t h i s does not appear to extend to the case al lowing r e ­ l a t i o n s . F i n a l l y in Section 6 we observe tha t the l i m i t s o f l e a r n a b i l i t y are not fa r o f f . Even i f i t i s known tha t a s ing le pure ly conjunct ive expression explains f i f t y percent of the occurrences of a con­ cept , d iscover ing or approximating t h i s conjunct ion may be NP-hard. The word learn ing has been used in numerous senses in phi losophy, psychology and A I . A usefu l d i s t i n c t i o n to make in t h i s context is whether the ava i lab le data from which the learn ing or induc t ion is to be made is consistent w i t h many var ied hypoth­ es i s , or essen t i a l l y w i t h j u s t one. In the former s i t u a t i o n , which i s o f ten t a c i t l y accepted in the philosophy of science, and also in machine learn ing

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Disjunctive normal random forests

We develop a novel supervised learning/classification method, called disjunctive normal random forest (DNRF). A DNRF is an ensemble of randomly trained disjunctive normal decision trees (DNDT). To construct a DNDT, we formulate each decision tree in the random forest as a disjunction of rules, which are conjunctions of Boolean functions. We then approximate this disjunction of conjunctions with...

متن کامل

A Unified Account of Conjunction and Disjunction Fallacies in People’s Judgments of Likelihood

This paper describes a simple continuous-valued logic which aims to explain the occurrence of both conjunction fallacies (where a conjunction AandB is judged more likely than a constituent A) and disjunction fallacies (where a disjunction AorB is judged less likely than a constituent A) in people’s judgments of likelihood for simple and complex events. In this model both these fallacies are the...

متن کامل

Learning DNF by Approximating Inclusion-Exclusion Formulae

Probably Approximately Correct learning algorithms generalize a small number of examples about an unknown concept into a function that can predict a future observation. More formally, let X and Y be the instance and outcome spaces, respectively. Then a PAC algorithm observes randomly drawn examples (x; f(x)) about an unknown concept f : X ! Y . These examples are independently and identically d...

متن کامل

Logical vs. Natural Language Conjunctions in Czech: A Comparative Study

This paper studies the relationship between conjunctions in a natural language (Czech) and their logical counterparts. It shows that the process of transformation of a natural language expression into its logical representation is not straightforward. The paper concentrates on the most frequently used logical conjunctions, ∧ and ∨, and it analyzes the natural language phenomena which influence ...

متن کامل

Σ 1 Formulae in Σ L

The essentially Σ1 formulae of ΣL are exactly those which are provably equivalent to a disjunction of conjunctions of 2 and Σ1 formulae.

متن کامل

Agnostic learning of disjunctions on symmetric distributions

We consider the problem of approximating and learning disjunctions (or equivalently, conjunctions) on symmetric distributions over {0, 1}n. Symmetric distributions are distributions whose PDF is invariant under any permutation of the variables. We give a simple proof that for every symmetric distribution D, there exists a set of n (1/ǫ)) functions S, such that for every disjunction c, there is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1985